Search results for "luonnollinen kieli"
showing 7 items of 7 documents
Recycling a genre for news automation: The production of Valtteri the Election Bot
2020
Abstract The amount of available digital data is increasing at a tremendous rate. These data, however, are of limited use unless converted into a user-friendly form. We took on this task and built a natural language generation (NLG) driven system that generates journalistic news stories about elections without human intervention. In this paper, after presenting an overview of state-of-the-art technologies in NLG, we explain systematically how we identified and then recontextualized the determinant aspects of the genre of an online news story in the algorithm of our NLG software. In the discussion, we introduce the key results of a user test we carried out and some improvements that these re…
Structured query construction via knowledge graph embedding
2020
In order to facilitate the accesses of general users to knowledge graphs, an increasing effort is being exerted to construct graph-structured queries of given natural language questions. At the core of the construction is to deduce the structure of the target query and determine the vertices/edges which constitute the query. Existing query construction methods rely on question understanding and conventional graph-based algorithms which lead to inefficient and degraded performances facing complex natural language questions over knowledge graphs with large scales. In this paper, we focus on this problem and propose a novel framework standing on recent knowledge graph embedding techniques. Our…
Automatic Content Analysis of Computer-Supported Collaborative Inquiry-Based Learning Using Deep Networks and Attention Mechanisms
2020
Computer-supported collaborative inquiry-based learning (CSCIL) represents a form of active learning in which students jointly pose questions and investigate them in technology-enhanced settings. Scaffolds can enhance CSCIL processes so that students can complete more challenging problems than they could without scaffolds. Scaffolding CSCIL, however, would optimally adapt to the needs of a specific context, group, and stage of the group's learning process. In CSCIL, the stage of the learning process can be characterized by the inquiry-based learning (IBL) phase (orientation, conceptualization, investigation, conclusion, and discussion). In this presentation, we illustrate the potential of a…
A Contrastive Evaluation of Word Sense Disambiguation Systems for Finnish
2019
Aiempi saneiden alamerkitysten yksiselitteistämistä käsittelevä työ, kuten monet muut luonnollisen kielen käsittelyyn liittyvät tehtävät, on enimmäkseen keskittynyt englannin kieleen. Vaikka hieman työtä on tehty myös muilla kielillä, mukaan lukien uralilaiset kielet, vertailevaa arviointia suomen kielen saneiden alamerkitysten yksiselitteistämisestä ei ole tähän mennessä julkaistu huolimatta siitä, että tarvittavat leksikaaliset resurssit, erityisesti FinnWordNet, ovat jo pitkään olleet saatavilla. Tämä työ pyrkii korjaamaan tilanteen. Se tarjoaa tuloksia merkittävimpiä lähestymistapoja saneiden alamerkitysten yksiselitteistämiseen edustavista ohjelmista, sisältäen joitakin parhaiten engla…
Multitask deep learning for native language identification
2020
Identifying the native language of a person by their text written in English (L1 identification) plays an important role in such tasks as authorship profiling and identification. With the current proliferation of misinformation in social media, these methods are especially topical. Most studies in this field have focused on the development of supervised classification algorithms, that are trained on a single L1 dataset. Although multiple labeled datasets are available for L1 identification, they contain texts authored by speakers of different languages and do not completely overlap. Current approaches achieve high accuracy on available datasets, but this is attained by training an individua…
From an encyclopedia of Iranian Folklore to an ontology of Iranian folklore
2014
The main resource of the thesis came from 37 years of research work by Ahmad Shamlou, an Iranian poet. The body of research should be transformed from manuscript to digital material. Designing and implementing a semantic ontology for such a folk encyclopedia is a basic and essential part of the thesis. As the material bank is significantly large, semantic marking of the materials in their entirety must be done in a group action and this thesis aims to develop the appropriate framework for the encyclopedia. The model aims to fulfill the philosophy behind Shamlou's approach to the concept of Encyclopedia as an indexical reference to the world by the language and words. The thesis will analyze…
Luonnollisten kielten kääntäminen ja konekäännös - Taustaa, teoriaa ja menetelmiä
2010
López, Elina I. Tietojärjestelmätieteen kandidaatintutkielma / Elina I. López Jyväskylä: Jyväskylän yliopisto, 2010. 36 s. Luonnollisten kielten kääntäminen on olennainen osa ihmisten elämää, erityi-sesti nykyisessä kansainvälisessä maailmassa. Ilman kääntämistä eivät esimer-kiksi yritykset pysty toimimaan. Käännettävät tekstimassat kuitenkin kasvavat kasvamistaan ja käännöstyön nopeuttamiseksi on haettu apua tietokoneista. Konekääntämistä onkin tutkittu ensimmäisten tietokoneiden käyttöönotosta lähtien. Tässä tutkielmassa käsitellään luonnollisia kieliä, perinteistä kääntämistä ja ko-nekäännöksiä. Tutkielmassa käydään läpi luonnollisten kielten jaotteluita ja ominaisuuksia, jotka tekevät …